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ABSTRACT 

Stirling's  formula  is  one  of  the  most  frequently  used  results  from 
asymptotics.  It  is  used  in  probability  and  statistics,  algorithm  analysis  and 
physics.  In  this  thesis  we  shall  give  a  new  probabilistic  derivation  of  Stirling's 
formula.  Our  motivation  comes  from  sampling  randomly  with  replacement 
from  a  group  of  n  distinct  alternatives.  Usually  a  repetition  will  occur  before 
we  obtain  all  n  distinct  alternatives  consecutively.  We  shall  show  that 
Stirling's  formula  can  be  derived  and  interpreted  as  follows  :  as  n — >«>  the 
expected  total  number  of  distinct  alternatives  we  must  sample  before  all  n  are 
obtained  consecutively  is  asymptotically  equal  to  the  expected  number  of 
attempts  we  make  to  obtain  all  n  distinct  alternatives  consecutively  times  the 
expected  number  of  distinct  alternatives  obtained  per  attempt. 
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L  INTRODUCTION 

A.    THE  PROBLEM 

Asymptotic  analysis  is  important  in  many  areas  of  modern  science,  such 
as  the  theory  of  probability,  complex  analysis  and  applied  mathematics. 

Because  the  factorial  function  and  its  asymptotic  behavior  are  often 
needed  in  mathematics  and  engineering,   Stirling's  formula 


n!    ~   nn  e  n  V 2kt\  ,  (n — >°°), 


(1) 


is  one  of  the  most  important  and  frequently  used  asymptotic  formulas.  The 
notation  in  (1)  means  that  the  ratio  of  the  left  side  and  the  right  side  tends  to 
one  as  n  tends  to  infinity. 

There  are  several  ways  to  prove  Stirling's  formula.  For  example,  one  can 
take  the  logarithm  of  n!  and  use  Wallis's  formula  to  obtain  the  factor  of 
(  k  )1/2.  For  this  type  of  proof,  see  [Ref.    1  1. 

Alternatively,  one  can  start  with  the  integral  representation 


n!    =     J    tV'dt 

J  0 


and  use  Laplace's  method    for    integrals    to    evaluate    it    asymptotically. 
See  [  Ref.    2  1  for  this  type  of  approach. 


All  these  methods  use  many  techniques  from  mathematical  analysis  and 
some  of  them  are  quite  sophisticated.  We  will  pursue  a  new  way  to  prove 
Stirling's  formula  using  a  discrete  or  combinatorial  approach. 

This  method  of  proof  was  mentioned  as  a  research  problem  in  [Ref.  3  1, 
and  the  purpose  of  this  thesis  is  to  present  a  solution  to  this  problem. 

B.   MOTIVATION 

From  (1),  Stirling's  formula  can  also  be  written  as 


n!  V2jm   ,e\-l     ,  x 

—  ~      -j-  (T)   .    (n->~). 
n 


Imagine  a  box  filled  with  n  distinct  balls.  We  shall  select  balls  at  random 
with  replacement.  The  motivation  for  our  approach  comes  from  noting  that 
n!/nn  is  the  probability  of  selecting  n  distinct  balls  consecutively  while 
(27cn)1/2/2  is  the  asymptotic  expected  number  of  distinct  balls  obtained 
before  a  repetition  as  n — ><».  These  expressions  appear  in  Stirling's  formula 
as  written  above  thus  indicating  that  a  combinatorial  proof  might  be  possible. 
The  purpose  of  the  next  section  is  to  define  the  combinatorial  set  up  in  more 
detail. 


IL    COMBINATORIAL  SET  UP 

A.  DEFINITION  OF  THE  GAME 

Imagine  a  box  filled  with  identical  balls  numbered  from  one  to  n.  We 
draw  a  ball  from  the  box  at  random,  write  down  its  number,  and  replace  it, 
mixing  the  balls  well  so  that  our  next  draw  is  also  made  at  random.  If  we 
continue  this  process  we  will  eventually  get  a  repetition  for  there  are  only  n 
distinct  balls  and  we  must  certainly  repeat  a  number  by  our  (n+l)st  draw. 

We  are  interested  in  the  task  of  drawing  out  all  n  balls  consecutively  in 
this  manner.  We  mean  by  this  that  after  n  consecutive  draws,  recording  the 
numbers  as  we  draw,  we  wish  to  obtain  a  permutation  of  the  sequence  (1,2,3, 
...  ,n).  If  we  obtain  a  repetition  before  the  desired  result,  then  we  start  over 
from  the  beginning. 

The  following  three  questions  are  of  interest: 

•  What  is  the  average  or  expected  number  En  of  distinct  balls  obtained 
before  a  repetition  occurs? 

•  What  is  the  average  or  expected  total  number  Tn  of  distinct  balls 
(adding  up  the  number  of  distinct  balls  obtained  in  the  first  game,  the 
second  game  ...  etc.)  selected  before  obtaining  n  consecutive  distinct 
balls  (i.e.,  some  permutation  of  (1,23/  —  ,n)  )• 

•  What  are  the  asymptotics  of  En  and  Tn  as  n — ><»  ? 

B.  THE  AVERAGE  OR  EXPECTED  NUMBER  OF  DISTINCT  BALLS  BEFORE 
A  REPETITION 

Let   pj    be  the  probability  that  we  get  exactly  j  distinct  balls  before  a 

repetition.     In  other  words,   since  only  j  distinct  balls  are  obtained,  pj  is  the 

probability  of  getting    a  repetition  on  the  (j+l)st  draw,  and  not  before. 


Since  we  assume  each  draw  is  independent,  the  probability  of  drawing  a 
specific  ball  at  any  time  is  just  1  /n.  We  always  obtain  at  least  one  distinct  ball 
in  any  play  of  the  game.  To  obtain  exactly  one  distinct  ball  in  a  game  we  must 
get  a  repetition  on  our  second  draw.  The  probability  of  doing  this  is  1  /n  and 
therefore  the  probability  of  obtaining  only  one  distinct  ball  is 


1 
pi    =    n 


The  probability  pi  also  can  be  written  as 

n-0  1 


Pi    = 


n    n 


To  draw  two  distinct  balls  before  a  repetition  we  must  get  distinct  balls 
on  both  the  first  and  second  draws.  The  third  draw  must  then  be  a  repetition 
of  either  the  first  or  second  draw.  Thus  we  are  looking  for  a  three-tuple 
where  the  first  two  elements  are  distinct  and  the  third  element  is  a  repetition 
of  either  the  first  or  second  element.  Since  all  elements  come  from 
{1,2,3, ... ,  n},  there  are  a  total  of  n3  three-tuples,  with  only  2n-  (n-1)  meeting 
the  above  requirement.  Therefore, 


n-0    n-1    2 
rL         n        n      n 


In  the  same  way, 


n-0    n-1    n-2    3 

p3   =   _ 

r*  n        n        n      n 


and  clearly 
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n-0  n-1    n-2    n-3  n-j+1    j      . .        ,  „  _  . 

Pi     = •••  — - —  — ,  (j  =    1,2,3,  ...,n). 

rj  nn         n         n  nnJ 


Let  us  check  that  the  sum  of  all  probabilities  is  one,  i.e., 


XPj  =   i. 
j  =  i 

Since 

n-0  n-1    n-2    n-3  n-j+1    j 


Pi     = 


n       n         n         n  n       n 


j  (n)  (n-1)...  (n-j+1) 

J+1 


n 


setting    j  =  n  gives 

n-0  n-1    n-2    n-3  n-n+1    n 


Pn     = 


n       n         n         n  n       n 


n 


n 


For  j  =  n-1, 


n-0  n-  1    n-2    n-3  n-(n-l)+l    n-1 

^nl  n       n         n         n  n  n 


n-1    n-2    n-3  2    n-1 


n         n         n 

n 

n 

(n-1)  (n-1)! 

n 

n-1 

n 

n 

(2) 


(n  -  1)  (n)! 

n 

n 


and  for  j  =  n-2, 

n  -  1    n-2    n  -  3         n-(n-2)+l    n-2 


Pn-2      = 


n         n         n  n  n 


n-ln-2n-3  3    n-2 


n         n         n 

n 

n 

(n-2)(n-l)! 

nn"22 

n(n-2)    n! 
2          n" 

In  general,  for  0  <  j  <  n-1, 

n-1    n-2    n-3  n-(n-j)+l    n-j 


Pn-j      = 


n         n         n  n  n 


=      (n-j)(n-l)(n-2)...q+l) 
nnj 


(n-j)(n-l)(n-2)...(j+l)     n^  j[ 
n  n     J 


nj-1(n-j)      n| 
J  n 


We  shall  now    sum  each  pj  from  pn    to  pi  in  reverse  order    beginning 
with 


Pn  +  Pn-i=    [l  +  (n-l)]^ 

n 


n  n! 


n 

n 


Continuing, 


and 


n  n!      rn  (n-2)  ,  n! 

Pn  +  Pn-l  +  Pn-2    «    ~+    t"^ ]  ~ 

n  n 


2 

,2n  +  n  -  2n  ,  n! 

~2      V 


2      n" 


2  2 

n     n!      n   (n-3)   n! 

Pn+Pn-l  +  Pn-2  +  Pn-3    =    "J"  ~~^  + 3] „ 

n  n 


2         3  2 

,3n   +  n   -  3n   .  n! 

~    V 


3        , 
6      nn 


We  claim  that 


Pn+Pn-1+...+Pn-j    =      \    Y      (0<j<n-l). 

r»  J" 


n 
We  can  prove  this  by  induction.   Assume 


n!     n 

Pn+Pn-l+"-+Pn-k    =       ~    F" 

n 


for  some  k  <  n-1  .  This  is  the  induction  hypothesis.  It  holds  for  k  =  1,  2,  and  3. 
Then 

Pn  +  Pn-1  +•••  +  Pn-k  +Pn-(k+l) 


n!_/       n^Vk-l)   n! 

"  nn    k!     +  (k+1)!  nn 


-BL  L   rij     (n-k-1)  1 

"  nn    k!      l  (k+1)     J 


I        k 

_  n!    n     .    n    . 

=  ~^  kf  Ck+P 
n 


,  k+l 

n!      n 


n"  (k+D! 


This  proves 


i  _)     _j 

XPn-k    =     Pn+Pn-l+--.  +  Pn-j    =       ~T    "jf    »    °   *  J   *   n'1' 


(3) 
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Equation  (3)  now  implies 

n  n-1 

LPk  =     XPn-j 
k=l  j=0 


n!     n 


n"  (n-D! 


=    1 


What  is  the  expected  number  En  of  distinct  balls  obtained  before  a 
repetition?   This  expression  is 


En   =    £kpk   =   p1+2p2+3p3+...+npn 

k=l 


=    Pl  +  P2  +  P3+»-  +  Pn 
+  p2  +  p3+...  +  pn 

+  p3+...  +  pn 


+  Pn 


=     l  +  (l-p1)  +  (l-p1-p2)  +  ...+(l-Pi-P2-...-Pn-l) 


From  (3), 


1-Pl-P2"..--Pj    =    Pj  +  l  +  Pj+2+---  +  Pn 


n-H+1) 

Pn-k 

k=0 

n! 

n 

n-(j+l) 

n 

n 

(n- 

0  +  1))! 

n- 

1   n 

-2        n 

-J 

n        n 


,  (l<j<n-l). 


Thus 


_  ,      n-1      n-1  n-2  n-1   n-2         1 

E_    =    1  +  + +  . . .  + . . .  - 

n  n  n        n  n        n  n 


(4) 


C    A  RELATED  GAME 

Now  we  consider  a  related  game  to  simplify  our  later  analysis.  The  rules 
for  this  new  game  are  as  follows:  initially,  we  start  with  n  distinct  balls.  Now, 
however,  when  we  select  a  distinct  ball  and  replace  it,  we  also  add  a  new  ball 
numbered  differently  from  all  previous  balls.  For  example,  suppose  we  have 
just  selected  the  kth  distinct  ball.  Before  our  next  draw,  we  replace  the  kth 
ball  and  add  a  new  ball  numbered  n+k,  so  that  our  next  draw  will  be  from  a 
pool  of  n+k  equally  likely  distinct  balls.  Thus,  each  time  we  draw  out  a 
distinct  ball,  the  number  of  balls  in  the  box  increases  by  one. 

This  game,  like  the  previous  one,  ends  when  we  get  a  repetition. 
However,  unlike  the  first  game,  now  it  is  possible  in  principle  to  obtain 
arbitrarily  many  distinct  balls. 
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If  the  game  ends,  we  empty  the  box  and  start  over  with  the  original  n 
balls. 

Now,  what  is  the  expected  number  of  distinct  balls  En*  obtained  when 
playing  this  second  game? 

Let  pf  be  the  probability  that  we  draw  out  j  distinct  balls,  i.e.,  repetition 
occurs  on  the  (j+l)st  draw,  and  not  before. 

Thus  pi*  is  the  probability  that  a  repetition  occurs  on  the  second  draw. 
Since  we  always  get  a  distinct  ball  on  the  first  draw,  after  the  first  draw,  the 
box  has  n+1  balls.  Therefore, 


n 
Pi     = 


n  n+  1         n+1 

In  the  same  way,  p2*  is  the  probability  that  a  repetition  happens  on  the 
third  draw,  and  not  before.  Therefore  the  first  and  second  draws  yield 
distinct  balls  so  that  after  the  first  draw  there  are  n+1  balls  in  the  box,  and 
after  the  second  draw  there  are  n+2  balls  in  the  box.  The  probability  of  getting 
a  distinct  ball  on  the  second  draw  is  n/(n+l),  and  in  order  to  get  a  repetition 
on  the  third  draw,  we  have  only  two  choices  out  of  n+2  balls  in  the  box. 
Therefore 


n     n 
P2    = 


n  n+1   n+2 
In  the  same  way  we  find 


n     n 

P3    = 


n  n+1   n+2  n+3 
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.        n     n       n       n  n        j 

PJ  n  n+1   n+2  n+3  ' ' '  n+j-1   n+J 


Again  we  need  to  show  that  the  sum  of  all  probabilities  is  one.  We  argue 

as  follows. 

«  •        n     l 
Pl    ~   n  nTT 


=    1-     n 


n+1 


,  .         , ,         nx/nn       2    x 

P,    +P2    =  (1-  ^)  +  (i7STH?2) 


=    1-    »    +    n     2 


n+1        n+1  n+2 


=  »-<sr>  (1"  & 


-  '-^'^ 


,         n         n  n       n        3 

Pl    +p2    +  p3   =    1  -   — -    — -  + 


n+1     n+2       n+1  n+2    n+3 


=  '-(^x^1-^ 
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n        n       n 


We  claim 


=     1 

n+1    n+2  n+3  , 

k 

Ipj'  =  i 

k 
n 

(n+l)(n+2)...(n+k) 

This  is  true  for  k=l. 

Assume  for  the  induction  hypothesis   that 


in 


m 

n 


(n+1)  (n+2)...  (n+m) 


Then 


E1  n  ,  n     n       n  n        m+1     . 

Pi   +  Pm+i  -    (n+1)  (n+2)...  (n+m)    +C  n  n+T  n+2  •'•n+m  n+m+1   ' 


n  n  m+l 


(n+l)(n+2) ...  (n+m)         (n+l)  (n+2) ...  (n+m)  n  +  m+l 


m  . 

=   i  -  ( 5 )  (i_  _E±L_) 

Mn+l)(n+2)...(n+m)    ,K         n  +  m+l  ' 


m+l 

=  i 


(n+ 1 )  (n+2) . . .  (n+m)  (n+m+ 1 ) 

This  concludes  the  induction  proof. 
Note  that  the  expression 
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k  nk 

^JPJ  l  "    (n+l)(n+2)...(n+k) 


can  also  be  written  as 


k  nk 

>  ■  Zpj*  = 


(n+l)(n+2)...(n+k)  ' 
J=1  (5) 


Now  let  us  consider 


k 
lim                  n 

k-x»  (n+l)(n+2).. 

.  (n+k) 

lim                      1 

Since 


it  follows  that 


k-x»  9  k 

(1+-)(1  +-)...  (1-R 
n  n  n 


1  +  —  — >  oo  as  k — >  oo 
n 


k 
lim   n _    n 

k->co  (n+l)(n+2)...(n+k) 


Letting  k — ><*>  in  (5),  it  then  follows  that 

k 


IPj*    =   k^IPj' 
j=i  j=i 


k 
lim  r  ,      n 

k->ooL1  "  (n+l)(n+2)...(n+k) 
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k 

1         lim 


k-x»  (n+l)(n+2)...(n+k) 


=  1 


Now  we  shall  find  the  expected  number    En*    of  distinct  balls  for  this 
second  game.  The  expected  value  En*   is 

oo 

En*  =    5>pk*=Pl-  +2p2'  +3p3*  +... 

k=l 

=    Pf+P2*+P3'+P4*+--- 
+  P2-+P3-+P4-+... 

+  p3    +p4  +... 

+  p4'+... 


=  1+(1-Pl')  +  [1  -(Pl'+  p2')]  +...  +  [1  -  (Pi'+Pa'  +  .-.+  Pj')]  + 


.       n         n      n  n       n  n 

1+ — r  +  — : =-     +  ...+ z T... r+.... 

n+1     n+1  n+2  n+1  n+2       n+j  (6) 


Note  that  to  derive  (6)  we  have  used  (5). 
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D.    ASYMPTOTICS  FOR  THE  TWO  GAMES 

We  shall  now  study  the  expected  values  of  the  two  games.  No  closed 
form  expressions  appear  to  exist  for  En  and  En*  as  given  in  (4)  and  (6). 
However  our  proof  of  Stirling's  formula,  as  mentioned  in  the  introduction, 
requires  the  asymptotic  behavior  of  En  as  n — ><».  In  this  section  we  shall 
study  the  asymptotic  behavior  of  En   and  En* . 

Recall  that 

n-1      n-ln-2  n-ln-2         1 


n  1 

En    =     £kPk  =1+  ?-—  + 


+  ...  + 


n  n        n  n        n  n 

k=l 


and 


En*  =    ZkPk'=  1+-V+   A 
n         M     Fk  n+1       n+1 


n  n       n  n       n       n 

+  ; ~    ?r  + 


n+2       n+1   n+2  n+3 

k=l 


Let 


n-0    n-ln-2        n-k 

a.   = ...  

k  n         n        n  n 


ft  n      n       n 


k         n+0  n+1  n+2  *"  n+k 


Then  ( 4)  becomes 

n  n-1 

En   =    ZkPk  =    Xak 
k=l  k=0 


(7) 


(8) 


(9) 
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while  (6)  becomes 

oo  oo 

E„*  =    Ikpk-  =    XPk 

k=l  k=0 

We  are  going  to  show  that 

n-l  °° 

X  ak  ~  X  Pk as  n  —  >  °° 

k=0  k=0 

or 

En    ~     En">     (n— X»). 


(10) 


(ID 


We  shall  do  this  in  several  steps  which  will  be  given  in  sections  1  through  4. 

1.      An  Inequality  for  En  and  a  Partial  Sum  of    En* 

The  goal  of  this  section  is  to  obtain  (15),  an  important  inequality 
relating  En  to  a  partial  sum  of  En*. 

To  accomplish  this,  there  is  an  inequality  we  shall  need.  It  is 


1-x   <   e"x  <t^-  ,  x>0. 

1+x  (12) 


We  prove  the  inequality  (12)  as  follows  : 

If  x  >  0,  then  -  x  <  0.  By  exponentiating  both  sides  we  get 
e"x  <  1,  or  1  -  e-x  >  0. 

If  x  <  0,  then  -  x  >  0.  By  exponentiating  both  sides  we  get 
e_x  >  1,  or  1  -  e_x  <  0.    Define  I(x)  by 
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I(x)   =    f  (l-c_t)dt 

Jo 


From  the  above  inequalities,  we  conclude  that 


However  , 


Kx) 

> 

0,  ifx*0. 

Kx) 

= 

X 

t  +  e"1 

0 

x  +  e  -  l 

Consequently 


e     >  l  -  x,  x*0.  Q3\ 


Note  that  equality  occurs  in  (13)  only  if  x  =  0.  When  x  >  0,  this 
expression  yields  the  first  inequality  in  (12). 

For  the  second  inequality  in  (12),  we  note  that  (13)  implies  e*  >  1  +  x, 
(x>0),  or  e-x  <  l/(l+x),  (x>0).  Returning  to  the  problem,  from  (7)  and  (12)  it 
follows  that  for  0  <  k  <  n-1 


11  n  n  n  n 


-0  -1  -2  -k 

<en    en    en...en 
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<     1       1       1         1 


1+-   1  +  -   1+-       1+- 
n         n         n  n 


n       n       n  n 


n+1   n+2  n+3  '"  n+k 


Therefore, 


-M*-H) 


ock   <    e   >•    <  pk,     (0<k<n-l). 


Summing  the  inequality  in  (14)   we  conclude  that 


n-1  n-1      ±2^111      n-1 

5>ks  Ie-sSpk 

k=0  k=0  k=0 


Note  that  (9),  (10)  and  (15)  imply 


En*    >   En 


(14) 


(15) 


(16) 
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2.      The  Divergence  of  En  and   En* 

We  shall  need  the  following  inequality,  special  cases  of  which  will  be 
useful  later.  This  result  is  given  as  an  exercise  on  page  60  of  [Ref.    4    1 . 

If  Uj  >  - 1  for  i  =  1,  2, ...,  m  where  m  >  1  and  \i\,  U2, ...,  um  are  all 
positive  or  negative,   then 


m 


na  +  M  >  i  +  5>i 

(17) 


i=l  i=l 


We  shall  prove  this  by  induction.  For  m  =  2  the  inequality  holds  since 

(1  +  Hl)  (1+M    =       1+^1  +  ^2+^1^2 
>      1  +  U!  +  U2. 

Assume  the  inequality  is  true  for    m  =  k,  i.e., 

k  k 

nO  +  Hi)   >    l+]>>i 

i=l  i=l 

This  is  the  induction  hypothesis.   Multiplying  both  sides  by     1  +  uk+  :    >  0, 
we  have 
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k+1  k+1  k 

i=l  i=l  i=l 


k  +  1 

>  i  +  X  Hi 

i=l 


The  above  inequality  holds  because    all    uj   are  either  positive  or  negative. 
This  completes   the  proof. 
Since 


ak  =    Q-hv-hv-h-V'h'    (0^k<n-l). 
K  n  n  n  n 


Clearly 

0<ak  <1,    (0<k<n-l), 


with  ak  =  1  only  when  k  =  0. 


We   claim  that 


(!.!)...( lA)>  1  .  (I  +  ...+JL),     (l<k<n-l). 
n  n  n  n 


This  follows  from  (17)  with  uj  =  -  i/n  and  m  =  n-1.  In  this  case  -1  <  Ui  <  0 
for  i  =  1,  2,  ...,  n-1.  Using  the  definition  of  otK  and  admitting  the  case  k  =  0, 
we  conclude 
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1-  (-  +  -U...  +  -  )  <  ak  <1,  (0<k<n-l). 

n       n  n  k  (      (i8) 


Since 


0       1  k         k(k+l) 

-  +  -  +  ...+-   =   — ^ - 

n       n  n  2  n 


(18)  may  be  written  as 


k(  k+  11 
1  -     K*      '  <  ak  <  1  ,  ( 0  <  k  <  n-1 ). 
1  n  K 


It  follows  that  if  k2  =  o(n)  as  n — >~  ( i.e.,  k2/n — >0  as  n — >«> ),  then 


ak >1  as  n — >«>. 


So,  for  example,  when  k  <  n1  /4 


ak >1  as  n — >«>. 


This  implies  En  — >~;  as  n — >©°,  and  since 


En*    >En 

n  n 


by  (16),  it  follows  that  En*— >«>  as  n— >©o  too. 
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Thus,  in  both  games  the  expected  number  of  distinct  balls  obtained 
before  a  repetition  occurs  tends  to  infinity  as  the  initial  number  of  balls  in  the 
box  tends  to  infinity. 


3.      The  Leading  Asymptotic  Contributions  to  En  and  En" 

In  this  section  we  shall  show  that 


kn  kn 

En   ~    l<xk>  En*   ~    £  Pk '("->-)' 

k=0  k=0 


when 


kn  = 


I  +  e 

n2 


,  (0<e<i). 


This  result  determines  the  leading  asymptotic  contribution  to  En  and  En*. 
In  order  to  show  this,  we  shall  need  Bernoulli's  inequality: 

(1  +  x)     >  1  +  mx,  if  m  >  1 ,  x  >  -1  and  x  *  0.  ng\ 

Bernoulli's  inequality  is  a  special  case  of  the  inequality  in  (17). 
Choose  Hi  =  x  for  i  =  1,  2, ...,  m  when  x  *  0  and  x  >  -1.  Then  (17)  implies 
(19). 

Using  (19)    for    any    positive    integer    k    with     2  <  k  <  n,      (1  - 
k/n)  <  (1  -  l/n)k ,  and  (1  -  k/n)  =  (1  -  l/n)k ,  for  k  =  1  or  k  =  0. 
Now  we  define 


23 


kn  = 


n 


—  +  c 
2 


,  (0<e<^). 


(20) 


2 


The  number  kn  is  the  largest  integer  less  then  or  equal  to  n      ,  i.e., 


(n2   -l)<kn<n2 


If  k  >  kn,    and  k  is  an  integer  then 


n*  <kn+  l<k 


so  that 


,2  ,2 

k  2e              -k                2e 

—   >  n    ,  i.e.,   <    -  n    . 

n  n 


(21) 


Recall  that 


n-l 


En"    =    IPk+  IP, 


k=0  k=n 


If  k  >  n, 


h  = 


n       n       n  n        n 


k  ""  n+0  n+1   n+2  "*n+n  2n+l   ""  2n+k-n 


so  that 


K  *  4)k"" 


Consequently, 
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k=n  k=n 


j=0 


n-k 


1        1         1 

z      2 


1 


>4 


=     2.  (22) 


n-l 

Since   En*   diverges,  it  follows  that  En*  ~  ^  Pk  ,  (n— >«>). 

k=o 

Now,  for  k  <  n, 


R  n      n      n 

Pk  - 


k  ""  n+0  n+1  n+2  ""  n+k 


=  P-swX^-srX1-^1-^ 


£  o-^o-^o-b^-o-e) 


i  °     i  '     i  2       i  k 


by  (19).  Since  the  right  side  of  the  above  inequality  can  be  written  as 
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1      0+1+2+.. .+k 
1  n 


and  0  +  1+2  +  .. .  +k    =    k  (  k  +  1 )  /  2  ,  it  follows  that 


This  inequality  implies 


k2 


MO"^' 


^  (e^p,    (by(12)) 


=   e4n 


Since  (21)  implies 


.2  2e 

k  -  n 

<   — - — 


4n 


for  kn  <  k  <  n,  it  follows  that 


2l 

■n 


Pk  <  e  4  ,  (kn  <  k  <  n). 
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Now  recall  from  (14),  that  when  k  <  n-1 


ak<  pk 


Combining  the  last  two  inequalities  then  yields 


ak  <  (ik  <  e  4  ,  (kn  <  k  <  n). 


It  then  follows  that 


2  t 

n-1  n-1  -" 

X   ak<    I    Pk<ne  4 

k=k„+l  k=k„+l 


Since    n  e  4  — >0  as  n— >«>,  it  follows  that 


n-1 

X  <*k — >° 

k=kn+  1 


n-1 

I   P  —  >o 

k=k„+l 


as  n — ><». 

Recall   that 


E/    =    IP 


k 
k=0 
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(23) 


(24) 


k„  n-1  °° 

-  IM  X  Pk+£pk 

k=0  k=kn+ 1  k=n 


Since  En  — ><»  as  n — >«>  and  the  second  and  third  sums  in  the  above 
expression  are  finite  by  (22)  and  (24).  It  follows  that 


En*~     IPk>    (n->~). 

k=0 


(25) 


Similarly, 


kn  n-1 

En=      Zak+Xak' 

k=0  k„+l 


Since  En  also  diverges  as  n — >oo,  and  the  second  sum  above  is  finite 
by  (23),  as  n — >o©  we  conclude 


K 
En  ~  Xak>    (n— >~). 

k=0  (26) 
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4.      The  Proof  that  En  is  Asymptotic  to  En* 

From  the  above  analysis,  we  conclude  that 


kn  kn 

En  ~  I  ctk   .  En*  ~  X  pk  ,    (  n->~). 

k=0  k=0 


We  shall  now  show  that 


k  k 

X«k~    XPk'  (n->~), 

k=0  k=0 


or  from  (9)  and  (10)  that 

thus  proving  (11). 

For  0<k<kn, 


En  ~  En*   as  n— >«*> 


n  n  n 

v  n+l  iln+2'     Si+k' 


2    i2         2    ->2  2    1,2 

v         2  2  2 

n  n  n 
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2  2  2 

(i  -<i)  )  (i  -  (|)  )•••  (i  -(£■)  > 


*  d-(-)  > 


>      1 (by  Bernoulli's  inequality  ) 

n 


>      1--^- 


n 


From  this  analysis  and  (14),  it  follows  that 


k 
n 


pk(l— 5L)   <  ak<  pk,    (0<k<kn). 


Summing  the  above  inequality  gives 


k  3      k»  K  K 

n  k=0  k=0  k=0 


(27) 


By  the  definition  in  (20), 


kn  = 


n 


-   +  e 
2 


,    (0<£<^), 


SO 
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3 

— +ev  — +3e 

— +  3e 

=  n2 


-J  1^3 

_n_  <  (n»    )         n 

2  2                     2 

n  n               n 


Since  0  <  £  <  1/6  ,  it  follows  that 


>0  as  n— >«> 

2 

n 


Letting  n — >°°  in  (27),  we  can  conclude  that 


k  k 

5>k~  XPk'    (  n->oo) 

k=0  k=0 


From  (25)  and  (26),  it  then  follows  that 


En  -  En*  ,  (n->~). 


(28) 
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III.  STIRLING'S  FORMULA 

A.    THE  COMMON  ASYMPTOTIC  VALUE  OF    En  AND  En* 
In  Chapter  II  we  proved  that 

En  ~  En'  ,  (n->~) 

In  this  chapter  we  will  determine  the  common  asymptotic  value  of  En 
and  En*. 

From  (14)  we  have 

•k(k  +  l) 


ak  <    e  >■    <,  pk,  (0£k£n-l) 


Hence, 


k  k  k 

"•n  »-n        -  k(k+l)  Kn 

Xak<£e— <;  lPk 

k=0  k=0  k=0 


Letting    n — ><»  in  (29),  and  using  (28)  we  conclude 


*»  *n        .k1      .k 


Xak   ~     £e2ne2°     ,  (n— >oo), 

k=0  k=0 


(29) 


(30) 
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*"  *n        .k2      -k 


ZPk   ~     Ze2n  e2n     »  (n— x»). 

k=0  k=0 


Consider  the  following  inequality: 


kn       k„       -k2  k„       -k2       -     k  k„       -    k2 


e*     £e2n     <    £e2'  e  *     <     £e' 

k=0  k=0  k=0 


Since  kn/  n  =  o(l)  as  n — >°o  f  we  conclude  from  (31)  that 


kn     _S_    ±  kn     V 

Xe2ne20  ~    Xe20,(n->oo). 

k=0  k=0 


From  (25),  (26)  and  (30)  it  then  follows  that 


kn    V 
En~     le11   ,  (n~>~), 

k=0 

and 

kn    V 
En'~     Xe2"'  (n->oo). 

k=0 


(31) 


(32) 


(33) 


To  asymptotically  estimate  the  sum  in  (32)  and  (33),  note  the  function 


2 

f(x)  =  e" 
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is  positive  and  monotone  decreasing  on  [0  ,  °°).  Setting 

1 


h    = 


V2n" 


we  then  have 


and 


f(kh)  =  e  ^     =  e^ 


f(x)  dx  <  X  h  WO  *  I      f(x>  <** 
k=i  ■r° 


Since 


kn  >  n2    -  1 , 


and  knh   =  it  follows  that 

V2  n 


'T^"2-1* 


or 

it.        1    /  e    1   \ 

k»h  >  7f (n' ??>• 

Hence,  kn  h — >«>  and  h — >0,  as  n — >«>. 
Letting  n — ><»  in  (34),  we  conclude 


/ 


l™  t    1      £c*   ]=   fV'dx  =  ^ 
*"-      V2JT   £{  J0  2 


or 


34 


(34) 


X°    ■£"  V27cn         ,  . 

e      ~     -^—  ,     (n— >oo). 


k=l 


(35) 


Equation  (32),  (33)  and  (35)  then  impliy 


_         V27cn      c  .       V27cn      ,      ^   x 


B.    THE   ASYMPTOTICS  OF  THE  EXPECTED  TOTAL  NUMBER  OF 
DISTINCT  BALLS 

In  the  first  game  considered  above  we  defined  Tn  to  be  the  expected  total 
number  of  distinct  balls  obtained  before  winning.  Let  p  be  the  probability  of 
winning  the  game,  and  q  =  1  -  p  be  the  probability  of  losing.  Recall  that  one 
wins  the  game  if  n  distinct  balls  are  obtained  consecutively  and  loses 
otherwise. 

Let  the  random  variable  X  be  the  number  of  times  we  play  the  game 
before  winning.  Then 


P(X=k)  =  pqkl,    (k  =  1,2,3,...). 


The  probability  p  of  winning  the  game,  i.e.,  of  getting  exactly  n  distinct 
balls  before  a  repetition,  is   p  =  pn  =   n!/nn,  by  (2). 
The  expected  value  of  X  is 


£kP(X=k)   =    £kpq 

k=l  k=l 


k-1 
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1_ 
p 


n 

n 


This  means  that,  on  average,  we  must  play  nn/n!  times  in  order  to  win. 
Therefore,  the  expected  total  number  of  distinct  balls  Tn  obtained  before  a  win 
can  be  represented  as  follows  : 

Tn   =n  +  (i-l)En. 

This  is  the  value  of  Tn  because  the  expected  number  of  plays  is  1/p 
(=nn/n!).  Among  these  plays,  one  must  be  a  win,  and  furthermore  that  win 
must  occur  on  the  very  last  play.  All  the  other  plays  are  losses.  When  we  win, 
we  draw  out  n  distinct  balls,  and  when  we  lose,  the  expected  number  of 
distinct  balls  is  En.  Therefore, 


Tn   =  n  +  (--l)En  . 

V  n  (37) 


This  expression  can  be  rearranged  as 


Tn   =  n-  En  +  y 
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From  (36), 


„         V27cn      „  .      V2jcn       .  x 

En sr-,    En =—  ,  (n— x«) 


So  n  -  En  =  O(n) ,  (n— >«>).  It  follows  that 


En 
T_   =  0(n)  +  — 

P  .  (38) 


Now  consider  En/p.  Since 


En~    En*  ,  (n— >-), 


we  can  write 


En         1   En     En' 

P         2    p        p  \  (39) 


But 


^L  +  hL  =   I(En  +  En-) 
p  p         P 


n      n-1 

I 

k=0  k=0 


n      n-i  °° 


,n    ,  r/  t     n-1     n-1   n-2  n-1  n-2         1  .      ..      n         n      n  ., 

n!  n        n      n  n      n  n  n+l     n+l  n+z 


37 


n-1  n-2  n-3  n  n  +  1  n+2 

n  n  n  .  ,     ,n        n  n  . 

:  ((iTT)T+0^  +  '(n^3)r     *•• l '    CnT     (n+1)!      (n+2)!        ; 


and  the  last  expression,  after  rearranging,  can  be  written  as 


2  n-2  n-1  n  n  +  1  n+2 

.,  n  n  n        s     ,n        n  n  x 

(1+n  +  2T  +  -+0^2)T+  (S^^  +  ISI)!-*^)!""^ 


n 

=   e 


It  follows  from  (39)  that 


-=-   ,    (n— >oo). 

P  2  (40) 


With  this  information,  letting  n — >©o  in  (38)  yields 


n 

Tn   ~   %-   ,    (n— >oo). 

2  (41) 


C    STIRLINGS'  FORMULA 

From  (38),  (40)  and  (41) 


En 

Tn -,  (n— >~) 

P  (42) 


i.e., 
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n 

Tn   ~  ^En   ,  (n-x~), 


Since 


(43)  implies 


Using  (41),  we  get 


En =—  ,  (n— x») 


nn  V27cn  . 


or  alternatively, 


n 

n 


n!  ~  —  flim  ,  (n— >»). 

n 

e 


This  can  also  be  written  as 


n!  ~  n  e      V27cn  ,  (n — ><»), 


and  this  is  Stirling's  formula  (1). 
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(43) 


e"       nn  V27cn       .  . 

T  "  U  T-  •  (n'">~)'  (44) 


IV.  CONCLUSION 

We  have  given  a  new  combinatorial  or  probabilistic  derivation  of 
Stirling's  formula.  Our  derivation  also  gives  another  way  to  interpret 
Stirling's  formula.  To  see  this,  let  us  consider  a  specific  value  for  n.  Suppose 
n  =  20,  i.e.,  there  are  20  distinct  balls  in  the  box.  The  expected  number  of 
distinct  balls  E20  obtained  is 


12.    12.11    I?.1817  J9!_ 

^20  :      1+20  +  2020  +  202020  +  "*  +  2()i9 


=    5.293584585. 
This  is  fairly  close  to  the  asymptotic  value  of  En  as  n — >°o,  i.e., 

^>    -    5.604991216. 

The  probability  of  winning  a  game  when  n  =  20  is 

20! 
P    =   P20  =   — ^ 
20     . 

so  that  the  expected  number  of  plays  before  a  win  is 


1         202° 

-  =   4^-  =   43099804 

p  20! 
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By  (37),  the  expected  total  number  of  distinct  balls  T20  obtained  before  a 
win  is 

T20  =   20  + (43099804- 1)(E20) 

=    228152473.  (45) 

The  asymptotic  formula  (41)  for  Tn  gives,  for  n  =  20, 


20 
ln  2 


-    242582598, 

and  this  is  of  the  same  order  as  (45).  As  n  gets  larger,  the  agreement  between 
the  exact  formula  for  Tn  and  the  asymptotic  formula  will  get  increasingly 
better.  Notice  that  when  n  =  20  the  expected  number  of  plays  before  a  win  is 
quite  large. 

To  conclude,  we  can  interpret  Stirling's  formula  as  written  in  (44)  in  the 
following  way  using  (42).  As  n — ><»  the  expected  total  number  of  distinct 
balls  obtained  before  a  win  is  asymptotic  to  the  expected  number  of  plays 
necessary  to  win  times  the  expected  number  of  distinct  balls  per  play. 
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